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Cooperation: Procrastinating Policies 
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Abstract —This paper considers multiterminal networks with 
energy harvesting transmitter nodes that are also capable of wire¬ 
lessly transferring energy to or receiving energy from other nodes 
in the network. In particular, the jointly optimal transmit power 
and energy transfer policies that maximize sum-throughput for 
the two-way, two-hop, and multiple access channels are Identified. 
It is shown for nodes with infinite-sized hatterles that delaying 
energy transfers until energy is needed Immediately at the 
receiving node is sum-throughput optimal. Focusing on such 
procrastinating policies without loss of optimality, the stated joint 
optimization problem can be decomposed into energy transfer 
and consumed energy allocation problems which are solved in 
tandem. This decomposition is shown to hold for the finite-sized 
battery case as well, using partially procrastinating policies that 
avoid battery overflows. It is observed that for the two-hop 
channel, the proposed algorithm has a two fluid water-fllling 
interpretation, and for the multiple access channel, it reduces 
to a single transmitter problem with aggregate energy arrivals. 
Numerical results demonstrate the throughput Improvement with 
bi-directional energy cooperation over no cooperation and uni¬ 
directional cooperation. 

Index Terms —Energy harvesting networks, energy coopera¬ 
tion, procrastinating power policies, two-way channel, two-hop 
network, multiple access channel, finite energy storage. 

I. Introduction 

Energy harvesting wireless networks offer the possibility 
of perpetual network lifetime, yielding low maintenance costs 
and less energy storage requirements at each node |fTl-|[^. 
However, the intermittent availability of harvested energy also 
renders realizing the full potential of these benefits chal¬ 
lenging. In particular, it is possible for central nodes of the 
network to become energy deprived due to energy harvesting 
conditions, impacting the network’s performance. To combat 
such energy outages, recent results utilizing wireless energy 
transfer a offer the possibility of energy cooperation for 
wireless ad-hoc networks in addition to signal cooperation 
a-Q. In this paper, we consider a generalized setting in 
this realm, and study multi-transmitter models where nodes 
can receive or transfer energy, in order to gain insights into 
network design by jointly optimizing data and energy transfer 
policies. 

Optimal power allocation in energy harvesting wireless net¬ 
works has recently been studied extensively. In fS), an energy 
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harvesting transmitter with infinite energy storage capability is 
considered, and the transmission completion time minimizing 
power allocation in a single link is found. The short-term 
throughput maximization problem is studied in 13, where 
the setting of M is extended to finite energy storage. A 
wireless fading channel with an energy harvesting transmitter 
is considered in G3, showing that a directional water-filling 
algorithm can be utilized to find the optimal power allocation. 
Multiterminal models, and energy harvesting transmitters and 
receivers are subsequently studied, see for example m-Ea 
and references therein. In addition to optimal power policies 
found in ||8l- l20l . alternative power management approaches 
such as nodes with inactive states ll2Tll are also considered in 
previous work. 

Since energy harvesting networks may experience energy 
deprivation when sufficient energy is not available to harvest, 
they can benefit from the recent advances in wireless energy 
transfer. Short range energy transfer is already present in 
today’s RFID systems E2\ . Energy cooperation is a viable 
option in mid-range as well, with wireless energy transfer 
efficiency values reaching up to 40% using coupled magnetic 
resonance 0. This provides the possibility of energy coop¬ 
eration, allowing networks to have additional control over the 
energy available at each node, as proposed in ||5l. In essence, 
energy cooperation introduces a new dimension for network 
optimization in energy harvesting networks. 

The problem of optimizing energy consumption for data 
transmission and energy transfer is introduced in Q, where 
a two-hop network with an energy harvesting transmitter and 
relay is considered, and the source can transfer energy to the 
relay. It is shown that throughput can be improved with respect 
to energy harvesting alone m, even with uni-directional 
energy transfer. Two-way and multiple access channels with 
uni-directional energy cooperation are also studied by the 
same authors in ii, proposing a two-dimensional water-filling 
algorithm to find the optimal policy. These studies assume an 
infinite battery size for the transmitters. Additionally, a differ¬ 
ent line of work studies transferring energy and information 
jointly, see ll^ - ll^ and many others. 

In this paper, we follow the model of transferring energy 
and data separately as in m-m- We generalize this set up to 
energy harvesting nodes all of which are capable of transfer¬ 
ring energy to one another, i.e., in any direction. We consider a 
general energy transfer model, which could be realized via var¬ 
ious energy transfer technologies such as magnetic induction, 
magnetically coupled resonance, or RE harvesting. Whereas 
the recent paradigm of simultaneous wireless information and 
power transfer (SWIFT) focuses on harvesting RE signals for 
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powering devices, and the associated trade-offs between using 
the energy for device operation or information decoding, we 
focus on optimally allocating and sharing of energy between 
devices over time. As communication models, we consider 
those in EJ-Cl, i.e., two-way, two-hop, and multiple access 
channels, allowing unrestricted energy transfers between all 
nodes. In addition to generalizing earlier works to unrestricted 
energy transfers, we also extend these models to the case 
where all nodes have finite battery and establish the optimal 
policy when the battery sizes are finite. We will see that 
allowing unrestricted energy transfers and limited batteries 
both require a careful solution methodology and bring on new 
design insights. 

Specifically, we optimize transmit powers and energy trans¬ 
fers under the aforementioned general setting. For clarity of 
exposition, first, in Sections [III [Hll and IIVI we consider the 
two-way channel with infinite-sized batteries at the transmit¬ 
ters. We prove that a subset of feasible policies, composed 
of those which postpone energy transfers until immediately 
needed, includes an optimal policy. Named procrastinating 
policies, this subset allows a decomposition of the joint 
optimization problem into separate energy transfer allocation 
and consumed power allocation problems. We subsequently 
show that the separation extends to two-hop (Section |V]l and 
multiple access channels (Section IVTl l. We demonstrate that a 
generalized extension of the directional water-filling algorithm 
na for the two-way channel solves the power allocation 
problem, while a single-user policy as in M, with scaled 
aggregate arrivals, suffices for the multiple access channel. 
In Section IVIII we extend our study to transmitters with 
finite-sized batteries, and show that partially procrastinating 
policies, defined therein, are optimal. Next, we leverage a sim¬ 
plified version of the two-dimensional water-filling algorithm 
in Q to solve the throughput optimization problem with joint 
energy transfer and transmit powers. We present numerical 
results in Section IVIIII demonstrating the advantage of bi¬ 
directional energy cooperation in energy harvesting networks 
over no cooperation ii, m and uni-directional cooperation 
E-HI. Section HXl concludes the paper. 

II. The Energy Harvesting and Energy 
Cooperating Two-way Channel (EHEC-TWC) 

We will first focus on the two-way channel and solve the 
problem at hand. We will then extend our solutions to the 
two-hop and multiple access models in Sections |V] and IVII 
respectively. 

Consider the Gaussian two-way channel (TWC) ll2^ with 
two energy harvesting and energy cooperating (EHEC) nodes, 
Ti and T 2 , as shown in Eig.[T] Denoting the channel inputs by 
Xk and the channel power gains hy hk, k = 1,2, the channel 
outputs at nodes Ti and T 2 after self interference cancellation 
are given by 

Fi = ^2X2 + iVi, (1) 

F2 = VIhXi + N2, ( 2 ) 

where Nk is Gaussian noise with power tr^ at node Tk, 
k = 1,2. Each node cancels out its own contribution to the 



Fig. 1. Two-way channel with energy harvesting transmitters and energy 
cooperation. 

channel output, i.e., Tk subtracts Xk from Yk, thus reducing 
the model to two parallel additive white Gaussian noise 
(AWGN) channels with channel power gains hi and /i 2 - We 
consider a static, i.e., time-invariant channel where cr^ and 
hk, k = 1,2, remain constant throughout the transmission 
duration. 

The communication session is divided into Isec long time 
slotfl indexed hy i = 1,..., A^. Throughout the paper, we 
denote node indices by the first subscripts k, j and £, and 
time slot indices by the second subscripts i and n. In time 
slot i, node Tk, k = 1,2, harvests Ek,i units of energy, which 
it stores in its battery of size Within this time slot, Tk 

transmits with average power pk,i, which requires pk,i units 
of energy due to the unit slot length. In addition to harvesting 
energy, the nodes are also capable of transferring energy to 
each other. In time slot i, Tk transfers Sk,i units of energy 
to Tj, j ^ k. This transfer has an end-to-end efficiency of 
ctk < 1. and Tj receives akSk,i units of energy as a result. 
The end-to-end transfer efficiency includes propagation loss, 
as well as other factors that scale linearly with the amount 
of energy transferred, e.g., circuit energy consumption at both 
parties. The power policy of the network is defined as the 
collection of transmit powers and transferred energy values 
{Pk,i,5k,i} for all k, j and i. 

The power policy of the network is constrained by the 
energy available to each node in each time slot. In particular, 
the energy remaining in the battery of node Tk depends on the 
energy harvested at node Tk, the transmit power of the node, 
and the energy transferred to and received from node Tj for 
j ^ k. We refer to the energy stored in the battery of Tk at 
the end of time slot i as the battery state Sk %, which evolves 
a^ 

Sk,i — min {Ek , Sk,i—1 T Ek,i Pk,i ^k,i Y , 

(3) 

for fc = 1, 2, with Sk,o = 0. The initial charge of the batteries 

'This choice is for simplicity. The results readily extend to slots with 
arbitrary length. 

^We note that energy harvests and transfers are not instantaneous, but 
take place throughout the respective time slot. Since energy consumption 
via transmission is also distributed over the time slot, it is possible, and 
mathematically convenient, to express energy values as arriving to or departing 
from the node at the beginning of the slot. 
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are introduced to the model through the energy harvests Ei^i 
and E2.1 in the first time slot. To ensure that the energy used 
by Tfe does not exceed the energy available at the node, we 
impose the set of energy causality constraints il, 13 

Pk,i + Sk,i ^ Sky -1 + Eky + CXjSjy, (4) 

for k = 1,2, i = which can equivalently be 

expressed as Sky > 0 for A: = 1, 2 and i = 1,... ,N. These 
causality constraints imply that energy cannot be consumed, 
neither for transmission nor for transfer, before it is harvested 
or received. 

For average transmit powers piy and p2y in time slot i, the 
sum-capacity of the Gaussian two-way channel in Fig. [1] with 
full-duplex nodes is given by 1291 

{piy,P2y) = i log (1 + i log (1 + . 

(5) 

We consider the offline problem (see also Q-Q, ifTTl . ifT^ . 
03), where the energy harvests Eky throughout the session 
are known at the beginning of the communication session. 
In addition to being applicable in networks with predictable 
energy arrivals, this approach also allows us to benchmark the 
performance limits of energy harvesting networks with energy 
cooperation. 

We consider the EHEC sum-throughput maximization prob¬ 
lem for the TWC over transmit powers pky and energy 
transfers 6 ky, throughout a communication session of N time 
slots, i.e., 

N 

max '^Cs^^{piy,p2y) (6a) 

i—l 

s.t. Sk,^>0, k = l,2, i = 

(6b) 

Pky > 0, Sky >0, k = l,2, i = (6c) 

where (l6Ft are the energy causality constraints, and (l6cl i are the 
non-negativity constraints for transmit power and transferred 
energy. We remark that similar problems that consider other 
criteria such as fairness can be formulated by updating the 
objective of (|6]l accordingly. 

We note that (|6]l is either solved by all nodes in the network 
separately, or solved by one of the nodes and the output 
communicated to the others. This requires energy harvests and 
channel parameters to be shared between all energy harvesting 
transmitters in the system. The relatively small communication 
overhead, which is in the order of a few bytes per time slot, 
will be omitted in this work for the sake of simplicity. 

Lastly, we remark that while linear end-to-end energy 
transfer losses are represented by {ak}, we do not explicitly 
account for other circuit or processing energy costs for energy 
cooperation. These can be incorporated into the optimization 
problem by embedding their cost models into ([3]l. Currently, 
experimental such models are being developed for specific 
transfer technologies, see for example H for strongly coupled 
magnetic resonance as the energy transfer technology. 


III. Properties oe Optimal Policies for EHEC-TWC 
WITH Infinite Batteries 

We begin with the infinite battery case, E^°‘^ = 00 . In 
addition to providing insights about the optimal policy, this 
case is also a good approximation for systems where battery 
capacity is sufficiently large, harvested energy is sufficiently 
low, or transmission session is sufficiently brief with respect 
to harvesting period. The properties found in this section are 
extended to the finite battery case in Section IVlIl Eor this case, 
the battery state in (O can be rewritten as 

i 

Sk,i — ^ ^ {^k,n Pk,n ^k,n) i ( 7 ) 

n—1 

for k^j = 1, 2 and j ^ k. 

The problem in (IHi involves the joint optimization of 
transferred energy and transmit powers of the two nodes over 
N time slots, i.e., AN variables in total. In this section, we 
identify properties of optimal policies, the use of which helps 
us eliminate the additional complexity introduced by energy 
cooperation. 

A. Procrastinating Policies 

We first show that a subset of power policies, named 
procrastinating policies, includes at least one optimal policy. 

Definition 1 A power policy {pkyiSky} is a procrastinating 
policy if it satisfies 

Pky ^jSjy ^0; j) ^ j ^ ^ — 1; • ■ • 5 Sf. ( 8 ) 

In each time slot, a procrastinating policy transfers energy 
from one node to the other only if all of the transferred energy 
is to be consumed for transmission immediately. This can be 
interpreted as the energy transferring nodes delaying energy 
transfers until the time slot they are immediately needed at the 
receiving end, hence the name procrastinating policies. In a 
procrastinating policy, energy transfers that are necessary for 
the feasibility of pky are postponed until the conditions in (l8]l 
are satisfied. The following lemma establishes the optimality 
of procrastinating policies. 

Lemma 1 There exists at least one procrastinating policy that 
is a solution o/®. 

Proof: Let i:Ski\ be an optimal policy which is not 
procrastinating, i.e., there exists Pkn"^ ^j^*j n for some n, 
j and k, j f k. Starting from f = 1, if ■ < 01 jS*^, set 
^ and update 5*,^^ = 5*,^^ + (1 - 

and 5*^ = pS *This postpones excess transferred energy 
to the next time slot whenever dSll is violated. Note that 
the update in time slot i only affects Sky and Sjy in (l6bl l. 
decreasing the former by ajS* i—p^i and increasing the latter 
by (5*j —p*kiloij. However, since Sky > 0 in the original 
policy and j = 0 from Lemma |2] this change does not 
violate (1601. Repeating the updates for i = 2,... ,N and 
fc = 1,2 yields a feasible procrastinating policy. Meanwhile, 
since pj ^ is unchanged, the objective in (l6al) is unchanged, and 
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therefore the resulting procrastinating policy is also optimal. 

■ 

Lemma [T] shows that by delaying energy transfers unless 
immediately required for transmission, any feasible transmit 
power policy {pk,i} can be realized with a procrastinating 
policy. Next, we utilize this property to decompose (|6]l into 
two subproblems regarding the energy harvesting and energy 
cooperation aspects of the original problem. 

B. Decomposition to Energy Transfer and Power Allocation 
Problems 

We define the consumed power pk,i as the power drawn 
from the battery of node Tk, taking both transmission and 
transfers in time slot i into consideration. This term is ex¬ 
pressed as 

Pk,i — Pk,i “t” ^k,i (^) 

Note that by definition, consumed power can be negative. 
However, a procrastinating policy, as defined in Definition [T] 
satisfies pk^i > 0 for all i = 1,..., iV and k = 1,2. We first 
present the following lemma. 

Lemma 2 There exists an optimal policy which satisfies 
Sk,iSj,i = 0 for all k,j = 1,2, j ^ k, i = 1,...,W i.e., 
energy transfer is never in both directions in a given time 
slot. 


Proof: Let be an optimal policy. Define 

5k,i = j for all k,j = 1,2, j k, and 

n = 1,..., iV, which satisfies 5k,i5j,i = 0. With these energy 
transfers, Sk,i in (|7]) increases for all k and i, and therefore 
the procrastinating policy j, 5j,i} is feasible. Since ^ are 
unchanged, it also yields the same objective as {p^ j, 5 *and 
is therefore optimal. ■ 

Lemma |2] is a natural consequence of transfer efficien¬ 
cies being less than 100%, and is intuitively pleasing. As 
a consequence of the lemma, we can restrict our attention 
to policies satisfying the lemma without loss of optimality. 
Hence, for procrastinating policies satisfying the lemma, the 
non-negativity constraints pk,i > 0 in (l6cl) are equivalent to 
Pk,i > 5k,i, i = 1,. ■., N, k = 1,2. Restricting the feasible set 
of (|6ll to procrastinating policies satisfying Lemma |2] without 
loss of optimality, we rewrite (|6]l as 


max 

{Pk,i,Slc,i} 


N 

E 

i=l 


c 


•TWC ( 


Pk,i + ak5k,i — 5j,i 


(10a) 


S.t. ^ ^ k5jk,n Pk,n ^0; ^ 2, i — 1, . . . , IV, 

n—1 

(lOb) 

Pk,i ^ 5k,i, 5k,i ^0, k — 1,2, % — 1, . . . , N. 

(lOc) 


Here, [pk,i\ = (pi.ijP 2 .i) denotes both parameters of 
which are found by substituting fc = 1, 2 and j ^ k. Note that 
the constraints in (Ihht , which include both energy transfers 5k,i 
and transmit powers pk,i, are replaced with (llObl i- dlOcl i where 
energy transfers and consumed powers are now decoupled. 


Furthermore, the ith summation term in the objective (IIPal i 
depends only on the variables for the respective time slot i. 
Hence, (fTol l can be decomposed as 


N 

max y^R{pi,i,P 2 ,i) 

{Pk,A 

i 

s.t. ^ ^ BIk,n Pk,n ^ 0 
n—1 

Pk,i > 0. 


(11a) 

= 1,2, 1 = 1,...,N, (11b) 
k = 1,2, i = l,...,N, (11c) 


where R{pi,i,P 2 ,i) is the per-slot sum-rate for consumed 
powers Pi i and p 2 ,i, defined as 


R{pi,i,P 2 ,i) = max {[pk,i +ak5k,i 

S-t- Pk,i ^ ^k,i E 0; ^ 


(12a) 
1,2. (12b) 


Note that (fT2t yields the optimal energy transfers 5k,i within 
a single time slot i for a fixed pair of consumed powers 
{pi,i,P 2 ,i)- Being separated from 5k,i, (fTTT l finds the optimal 
allocation of consumed powers pk,i, i = 1,..., N throughout 
the transmission. This decomposition implies that the power 
transfer optimization can be performed separately and in a 
slot-by-slot basis, i.e., the optimal energy transfers 5k,i can be 
found using only the consumed powers pk,i in the same time 
slot. 


Lemma 3 i?(7ri,7r2), is jointly concave in tti and tt 2 . 


Proof: The proof can be found in Appendix lAl ■ 

As a result of Lemma [3 the consumed power allocation 
problem in (fTTI) is a convex program. Furthermore, the con¬ 
straints in (II Ibl) and (IIIcl l are separable among transmitters 
k = 1,2, and hence a block coordinate descent (alternating 
maximization) algorithm that alternates between {pi,i} and 
{P 2 ,i} converges to the optimal policy ll^ . In particular, at 
each iteration, a single transmitter problem with a concave 
objective function and linear energy causality constraints is 
solved. The iterations evolve, alternating over the optimized 
variables, until the policies converge. Namely, we solve 


N 


max 

{Pk,i} 

'^R{Pl,i,P2,z) 


(13a) 

s.t. 

i 

^ ^ ^k,n Pk,n ^ 0; 

TL — 1 

1 = 1,...,N, 

(Ob) 


Pk,i > 0, 

i = l,...,N, 

(13c) 


for a fixed k at each iteration, alternating between k = 1 
and k = 2, while {pj,i}, j k, is held constant. Note that 
(foi l differs from its counterpart without energy cooperation ifSl 
only in the rate function R{pi,i,p 2 ,i). However, pj,i, j k, 
may change in time, and hence the solution to each iteration 
step in (foi l is not the constant power policy in IQ. Instead, 
it can be found using a generalized directional water-filling 
algorithm, as we will describe next. 
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IV. Optimal Policy for the EHEC-TWC with 
Infinite Batteries 

The decomposition in (fTTT i- lfTSl i simplifies the analysis of 
the problem by separating the power allocation problem from 
energy transfer variables {5k,i}, and calculating optimal en¬ 
ergy transfers in a slot-by-slot basis. We first solve the energy 
transfer problem within a single slot, i.e., (fT^ . which we then 
substitute in ( fOl l to solve (fTTl l. 


A. Optimal Energy Transfers for the EHEC-TWC 

Consider time slot i first. We focus on the two subsets of the 
feasible space of (fT^ . namely those satisfying 5i,i = 0 and 
52,i = 0, one of which contains an optimal policy as implied 
by Lemma |2] We solve (fTZt for these subsets, and choose the 
maximum of the two. Eor the policies satisfying 5j,i = 0, the 
solution to (fTZt is found as 



for k f j, where [x\^ denotes max{0,a;}. This yields two 
optimal transfer candidates, <5* ^ and 5 ^,, each requiring the 
other to be zero. Note that the case where both candidates 
are positive, i.e., 5{ ^ > 0 and 5^ ^ >0, requires aia 2 > 1, 
which is not possible since ai,a 2 < 1 by definition. Hence, 
at least one of the two candidates is always zero, and (fl4l l 
immediately gives the solution to (fTSl i. The per-slot sum-rate 
achieved by the optimal energy transfer policy, corresponding 
to R{pi,i,p2,i) in (HD (fTZt . is then expressed as 


R{Pl,i,P2,i) = 


Cs^^{pi,i,P2,z), 



dh = d'. 

* 

2,i 

= 0 , 

log ( 

\/aihih2 

2cri(T2 

{{^+Pw) + 

1 

Oil 

(eI 

\h2 









0 < 

< 

Pi,i, 

log ( 

yyct2hih2 

2(71(72 

((S +P2.^) + 

1 

0:2 

(-1 

\hi 

+ pm))). 








0 < dl. 

< 

P2,i, 

5 log 


{p2,i + aipi,i)] 

? 


0 < 5^, 

= 

Pl,i, 

I log 


{pi,^ -f a2p2,i)] 

5 


0 < <)2*.. 

= 

P2,i- 


(15) 


B. Optimal Power Allocation for the EHEC-TWC 


and sufficient for optimality, and are found as 


dR{pi,i,P2,i) 

dpk,i 


N 

+ ^ ^ — Tk,i = 0, 

n—i 


(16) 


5<k,iSk,i — Oj '^k,iPk,i — 0; (17) 

for k,j = 1, 2, j k, and i = 1,... ,N. Here, Xk,i > 0 and 
Tk,i > 0 are the Lagrange multipliers for the constraints in 
(I13bl l and dOcl l, respectively. Eor the optimal {pk,i} solving 
(HD, there exists a set of non-negative Lagrangian multipliers 
that satisfy the conditions in (fTfil l- (fTTl l. and vice versa. We 
define 


'^k,i — 


f dR{pi,i,p2,i) 

V dpk,i 


-1 


(18) 


Erom (HD, we observe that whenever pk,i > 0, we have 
Tk,i = 0. In this case, from (fTfil l, we see that optimal {vk,i} 
are constant in i unless Xk,i > 0. Meanwhile, a positive Xk,i 
is only possible when Sk,i = 0, i.e., the battery is empty. 
Moreover, since Xk,i are non-negative, optimal {vk,i} are non¬ 
decreasing in i. Due to this behavior, we refer to {vk,i} as 
generalized water levels, and utilize the directional water¬ 
filling interpretation in ifTOl to find the optimal water levels. 

In particular, in each iteration for Tk, transmit powers and 
water levels are initialized by setting pk,i = Ek,i for all i = 
1,..., TV. If the water levels satisfy Vk,i > Vk,i+i for some i, 
this results in water (energy) flow from slot i to slot i -f 1, 
which is achieved by decreasing pk,i and increasing pk,i+i by 
the same amount. The flow stops when the two water levels 
are equalized, or when pk,i = 0. In the latter case, (m is 
satisfied via some Tk,i > 0, which is feasible since pk,i = 0. 
Elow in the reverse direction, e.g., from time slot i -f 1 to i, is 
not allowed since it violates the energy causality constraints 
in dfibl l. The algorithm terminates when water levels do not 
permit any water flow. 

The generalized directional water-filling algorithm above is 
then repeated at each iteration. Eor the iterations on pk,i, where 
pj,i, j k is kept constant, the water levels are given by 


Vk,i = < 


^ (lit + Pfe-*) ^ 

(lit + ^^(717 ’ 

(lit (tIJ > 

2 (pk,i + ^^(717 ’ 

{it + Pk,^ + 0‘jPj,t) . 


0 < <Pk, 

0 < 5* <pj, 

0 < 5*=pk, 

0 < 5* = pj. 

(19) 


Substituting (fTsT i in (fTTl l. it remains to solve for the optimal 
{Pk,i} by iterating between pi,i and p 2 ,i. We now show that 
the solution to each iteration admits a generalized directional 
water-filling interpretation, and consequently (|6]l can be solved 
using the generalized iterative directional water-filling algo¬ 
rithm Ifm. ifTIl. 

As shown in Lemma [3 ( fTTl l is a convex program with 
affine constraints. We also remark that i?(7ri,7r2) in (fTSl l is 
a continuously differentiable function in both tti and 7r2. 
Hence, the KKT optimality conditions of (fTTl l are necessary 


We present an example of the directional water-filling 
algorithm for TV = 4 time slots of length 1 sec and ai = 
a 2 = 0.5 in Eig. |2] Energy arrivals to the two nodes are 
El = [2,5,0,0] mJ and E 2 = [0,4,0,7] mJ. The final 
(equilibrium) water levels are shown in blue for node Ti, 
and in green for node T 2 , for hi — h 2 = —100 dB, and 
af — = 10“^^ W for a 1 MHz bandwidth. Observe that 

in the first time slot, pi,i — 2 mW and p 2 ,i = 0 yields the 
optimal energy transfers <5* = 1 mJ and 1 = 0, i.e., node 
Ti transfers 1 mJ of energy to node T 2 as indicated with 
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Fig. 2. Optimal water levels found by iterative generalized water-tilling with 
water levels in GD. 


the red arrow. The energy transfer candidates for time slots 
1 = 2,3 are zero, and no energy is transferred. In the last time 
slot, the optimal energy transfer rate is found as (5^ 4 = 0 and 
i 52 4 = 2 mJ, i.e., the energy transfer is from node T 2 to node 
Ti. With the final water levels in the figure, no further water 
flow is feasible for either node. 


V. The EHEC Two-Hop Channel 


We next consider a two-hop channel (THC) with infinite¬ 
sized batteries as a simple example of a multi-hop setting as 
done in 0 , extended to bi-directional energy transfers. The 
channel model is as shown in Eig. |3] Eor this case, we denote 
the transmit power of the source node by pi i, the relay node 
by P 2 ,a and the source-relay and relay-destination channel 
power gains by hi and / 12 , respectively. The messages are 
delay constrained, and the relay node T 2 needs to forward all 
received messages immediately to the destination. As such, 
the relay does not have a data buffer, and departs packets in 
the same time slot they are received. The source and the relay 
are both capable of energy transfer. The sum-capacity for this 
channel with a full-duplex relay is given by 


Cs^'^iPi,i,P 2 ,i) = min 







( 20 ) 


Note that as in the two-way model, Cjis jointly concave in 
Pi i and p 2 ,i since it is the minimum of two jointly concave 
functions. Hence, the throughput maximization problem for 
this channel also satisfies Lemma [T] and therefore allows the 
decomposition in (fTTT i- ffTSl i. 



Fig. 3. The two-hop channel model with energy harvesting transmitters and 
relay, and energy cooperation. 


A. Optimal Energy Transfers for the EHEC-THC 

Given the capacity expression in (l20l l. the two-hop version 
of (fT2l l can be written as 

max mini H {pl,i + alSl^^-S2,i ), ^ (p2,*+a2^2.i-^i,i) 1 

( 21 a) 

S.t. Pk,i ^ b; ^ (21b) 


The objective is the minimum of two linear functions, and the 
two terms of the minimum change in opposite directions with 
or ( 52 , 1 . Hence, the optimal is attained when the two terms 
are equal, if feasible. Solving (1211 1 for (5i,i and ^ 2 ,^ satisfying 


— (pi,i + Cti(5i,i — (52,i) — — (p2,i + Ck2<52,i 


yields the energy transfers 


i5i,i — 
<52,1 = 


CTi/tipi,, - alh2P2,i 
Q!i(t|/i2 + crfhi 
(ylh2P2,% - crlhipi^^ 
a2crihi -f tT|/i2 


4 -I- 


<5i,*) (22) 


(23a) 

(23b) 


where [x]+ denotes max{0,x}. Since Q!i,a 2 > 0, these 

energy transfer values are feasible and therefore optimal. 

Observe that due to (l23T l. the difference of received powers, 

i.e., ^pi i — ^p 2 i determines the direction of energy transfer, 
^2 ’ '^1 ’ 

and the transferred energy is nonzero unless the two received 
powers are equal. 


B. Optimal Power Allocation for the EHEC-THC 

Substituting the optimal values in (l2Tt into the power 
allocation problem in (fTTT i yields 


max 

{Pk,i} 


N 

log ^1 -f hih 2 min 

i=l 


{ 


Pl,i+(y2P2,i 

OL2<y‘ihl-\-(T2h2 ’ 


QlPl,i+P2, 

cr‘(hi-\-cxia‘2h2 


fe}) 


(24a) 


I 

s.t. y ^ Pk,n ^ Oj Pk^i ^0; Z = 1,...,V. 

n—1 

(24b) 


Due to the convexity of the problem, the generalized iterative 
water-filling algorithm in Section IIV-BI can also be used for 
(l24l i by solving (l24l i iteratively in and {p 2 ,i}- In this 

model, the generalized water levels are found for the {pk,i} 
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iteration, keeping j ^ k constant, as 

, cFlhkpk,i < a'jhjPj^i, 


We remark that the water levels are linear in transmit powers, 
and therefore the algorithm resembles conventional water- 
hlling II 3 TI . In the iteration on {pk,i}, the consumed powers 
^ ^te kept constant, which introduces a base level 
over which water-hlling is performed. The hrst terms in (l25T l 
are consumed powers, the second terms are base levels due to 
the other transmitter, and the third terms are constant. 

We further remark that water levels vi^i and V 2 ,i are linearly 
related, with ratio a 2 or ai depending on the direction of 
energy transfer. As a consequence, unless ai = a 2 = 1, if 
the water levels in two consecutive time slots are equal for 
both transmitters, the direction of energy transfer must remain 
the same as well. An insight that can be drawn from this 
observation is that the direction of energy transfer remains 
unchanged in time unless the water levels change for one of 
the nodes, which only occurs when the respective node is out 
of energy. 

Combining these two remarks, we observe that the gen¬ 
eralized directional water-hlling algorithm has an intuitive 
two-huid interpretation for the two-hop channel. Namely, we 
can solve (EUl by considering pi^i and p 2 .i as levels of two 
immiscible fluids, and scaling these Huids appropriately in 
each iteration while performing directional water-hlling based 
on the total water level. The analogy is even more apparent 
in the uni-directional energy transfer case, where a 2 = 0. 
This gives V2,i = 00 whenever afhipi^i < cr|/i2P2,i, thus 
restricting the solution to afhipii > (T|/i 2 P 2 ,i- With this 
restriction, we have V 2 ^i = and therefore no iteration 

is necessary. The optimal consumed powers are found as 
the resulting water levels when both Huids are allowed to 
How while satisfying the condition afkipi^i > fT|/i 2 P 2 ,i- An 
example to this two-huid water-hlling is depicted in Fig. |4] for 
a\ = 0.5, a 2 = 0, Ei^i = [4, 0, 2, 6] mJ, i? 2 ,i = [0, 3,0, 0] mJ, 
and the same channel parameters in Fig. |2] Note that in this 
example, water How for node T 2 (green) from i = 2 to i = 3 
occurs, even against the level gradient, until the condition 
o’ihipi _2 > cr|h 2 P 2,2 is satished. 


crlhkPk,i > cr^hjPj^i. 

(25) 


'^k,i — ‘ 


Pk,^ + ajPj,^ -f 


Pk, 


Pj,i 

Ckfe 


tij 


VI. The EHEC Multiple Access Channel 


In this section, we extend the results in Sections UniandllV] 
to the Gaussian multiple access channel (MAC) with inhnite- 
sized batteries, shown in Eig. |5] The sum-capacity for this 
channel in time slot i is given by 


{pi^i,P 2 ,i) = 2 log ( 1 -p 


hiPi,i h 2 P 2 ,, 


(26) 


The corresponding sum-throughput maximization problem, 
i.e., the MAC version of (|6|l, is 


N 

max '^C^^^{pi^i,p 2 ,i) (27a) 

1—1 



Fig. 4. Directional water-filling (a) initial levels and (b) levels after water 
flow, for an example setting with N = 4. The blue, green and orange areas 
correspond to source consumption pi, relay consumption p 2 and the constant 
term in IlD, respectively. 


s.t. S'fc,i>0, Pfc,i>0, 4,J>0, k = l, 2 , i = 

(27b) 

Since is also jointly concave in pi ^ and p 2 ,u the 

MAC sum-throughput maximization problem also satishes 
Lemma [T] yielding the decomposition in (dU-Clll. In the 
following subsections, we address the energy transfer and 
power allocation subproblems for the MAC. 


A. Optimal Energy Transfers for the EHEC-MAC 

Substituting the consumed powers in (|9|l into (l27l l yields 


max 

{Pk,i} 


N 

r<MAC 
'-'S 

i=l 



Pk,i tXk^k,i 


i 

s.t. y ^ Ek^n Pk,n ^0; k = 1,2, i = 1, 

n—1 


(28a) 

N, 

(28b) 


where ^ and 62 ,i are found as the solution to the energy 
transfer problem 


^ f ^2 



h2\ 

max dii [ ai—^ 


+ 62,i 1 Q;2—2 “ 

- — 

’ \ 


V <^2 



s.t. 0 < 6 k,i < Pk.i, k=l,2. 


(29a) 

(29b) 


Note that (|2^ is a linear program, with the optimal achieved 
at a corner of the rectangle dehned by (I29bl i. The optimal 
policy is to choose 4,i = Pk,i if dkO'jhj > cr'^hk, and choose 
6 k,i = 0 otherwise. Consequently, the allocated power at Tk is 
entirely transferred to Tj if akCTjhj > cr'f.hk, or is entirely used 
for transmission if akCTjhj < cr^hk- This also implies that 
energy transfers only depend on the channel parameters, and 
hence the optimal energy transfer direction remains the same 
throughout the transmission. As a result, uni-directional energy 
transfer is sufficient from one user to the other in the direction 
determined by the channels and their transfer efficiency values. 
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Fig. 5. K-transmitter multiple access channel with energy harvesting 
transmitters and energy cooperation. 



Fig. 6. Optimal sum-power policy for an energy cooperating MAC for a 
deadline of = 4 time slots. 


B. Optimal Power Allocation for the EHEC-MAC 


The analysis in Section IVI-AI reveals that in the optimal 
policy, either no energy transfer occurs, or one node transfers 
all of its energy to the other. In the former case, we get an 
energy harvesting MAC without energy transfers, the sum- 
capacity of which was found in ifTTIl . The problem is solved by 
combining harvested energy in a single pool, thus reducing the 
problem to the single link power allocation problem in ifSl. In 
the latter case, let a 2 Uihi > cr|/l 2 without loss of generality. 
Then, the optimal energy transfers in Section IVI-AI yield the 
water-levels 

2 2 - 
^2 - - <^2 Pli - 

Vl,i = — + Pl,i + a2P2,i, V2,i = - - 1 - -+P2,i- 

hi hia2 C(2 


(30) 


Note that vi^i = a 2 V 2 ,i- In this case, we can equivalently 
consider the policy of node Ti only, and transfer all energy 
harvested by node T 2 immediately to node Ti. We establish 
this by scaling {£’ 2 ,^} with the end-to-end efficiency of the 
transfer, a 2 , and adding them to the harvests of the trans¬ 
mitting node, {£1 i}. The resulting problem consists of a 
single energy harvesting link, which can be solved as in fSl. 
Therefore, in both cases, the power allocation problem reduces 
to that of a single link. In order to generalize the solution to 
all cases, we define 


aj = max 


/ ak<j]hj \ 

[ ’ <^lhk ) ’ 


(31) 


and find the optimal power allocation policy as the solution 
to the single user problem with equivalent energy harvests 


£, = at£i,, + a*£2.z. 


(32) 


The solution is a piecewise constant, non-decreasing sum- 
power policy, in which the sum-power only changes when all 
batteries are depleted. A depiction of the optimal sum-power 
policy is presented in Fig. |6] for N — 4 time slots, where 
the staircase represents the cumulative harvested energy and 
the piecewise linear curve represents the cumulative consumed 
energy. 


Remark 1 Optimality of procrastinating policies also extends 
to channels with more than two transmitters, such as the K- 
user MAC, as shown in In this case. Lemma \2} extends 
to not transferring and receiving energy simultaneously, re¬ 


gardless of the direction 4521 Lemma 1 ]. The procrastination 
condition extends to the sum of energy transfers arriving to 
a node 4521 Eqn 12], This allows the iterative directional 
water-filling algorithms above to be used for such models, 
by iterating over all transmitters. Due to space restrictions, 
we refer the reader to 4521? for details. 

VII. Optimal Policies for Nodes with Finite-sized 
Batteries 

We now extend our model, formulation, and solution to 
nodes with finite-sized batteries. In Lemma [T] it is shown 
that an optimal procrastinating policy exists if the batteries 
are infinite-sized. This is justified by always being able to 
postpone energy transfers which are not consumed within the 
same time slot, i.e., which do not satisfy ([8]). In the case 
of finite-sized batteries, this argument is no longer sufficient, 
since postponing energy transfers from Tk to Tj may yield a 
battery overflow at Tk that the original energy transfer policy 
would have avoided. In this section, we provide a class of 
policies that procrastinate to the point they can avoid such 
overflows, and show that they are optimal policies for the 
EHEC two-way channel with finite-sized batteries. 

Consider the finite battery two-way channel model in Sec¬ 
tion ini i.e., E^°‘^ < 00 in 0. We first reiterate that the 
optimal policy should not cause any battery overflows. This is 
an extension of Lemma 2], which states that a power policy 
that yields a battery overflow is suboptimal. In particular, 
energy overflow in time slot i can be avoided by consuming 
more energy in time slot i — 1. This strictly increases the 
sum-throughput in time slot i — 1, and does not affect the 
battery state Skj for j = i,. .. ,N. Therefore, without loss 
of optimality, we restrict our attention to policies that do 
not cause energy overflows. We use (|2l), while imposing the 
constraint Sk,i < in (|6ll. The equivalent sum-throughput 

maximization problem for a TWC with finite-sized batteries 
is 

N 

max VcJ^'^(pi,*,P 2 ,i) (33a) 

{pk.iAk.i} ^ 

2—1 

s.t. > Sk,i > 0, /c = 1,2, ? = (33b) 

Pk,i > 0, 4,2 >0, fc = 1,2, i = 1,..., V, (33c) 
where Sk,i is given by (|7]). 
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A. Partially Procrastinating Policies 

We next modify the set of procrastinating policies to pre¬ 
vent energy overflows. We begin by splitting Sk,i into two 
components, yk,i > 0 and ek,i > 0 , as 

^k,i = 7fc,i + ^k,i- (34) 


In dlOll, we have selectively imposed the partial procrastination 
conditions (l3l and (l36T l without loss of optimality due to 
Lemma |4] Note that (I40al l and (I40cl l are independent of ek,i, 
while (I40bl i and (I40dl i are independent of 7 ^ i. Moreover, the 
ith summation term in (I40al i depends on pky, yk,i, and ek,i 
only, allowing us to decompose (l40l i as 


These components represent the portion of the transferred 
energy that is consumed immediately, and the excess portion 
that is stored for future use, respectively. Clearly, power 

policies defined as include all feasible power 

policies for (| 6 ll. Based on these variables, we define partially 
procrastinating policies, which are an extension of procrasti¬ 
nating policies in Section IIII-AI as follows: 

Definition 2 A power policy {pk,i,Jk,i,£k,i} A a partially 
procrastinating policy if it satisfies 

Pk,i - > 0, (35) 

7 i,i 72 ,z = 0, (36) 

ek,^ - Sk,i) = 0 , (37) 

for k, j = 1,2, j k, and i = 1,..., N. 

In a partially procrastinating policy, the condition for pro¬ 
crastination, i.e., (O, is restricted to the immediately consumed 
component 'yk,i, as seen in (l35ll. Meanwhile, the excess 
component ek,i can only be nonzero if the battery of Tk is 
full, i.e., Sk,i = 75™“^, as dictated by dJTl i. This component 
allows transferring the excess energy that would otherwise be 
lost due to battery overflows. We next show that there exists 
at least one optimal policy that is partially procrastinating. 

Lemma 4 There exists a partially procrastinating policy 
{Pk,iy lk,ii Efe.i} such that the transferred energy values 
calculated from f li4l ) and the transmit powers {pk,i} solve ( 1351 ). 

Proof: The proof can be found in Appendix |B] ■ 


B. Finding the Optimal Power Policy 

We update the definition of consumed powers in (|9l) as 


Pk,i — Pk,i 


(38) 


for k,j = 1,2, j f k, and i = 1,... ,N. Substituting in (|7]l, 
this yields the battery state 

i 

Sk,i — ^ ^ {Ek,n CXjCj^n Pk,n) • (39) 

n—1 

We next rewrite (l33] t in terms of pk^i, ^k,i, and ek,i as 

N 

max ^ {[pk^i - ^k,i + 

{pk,i,'yk,i,ek,i} 

1—1 


(40a) 


s.t. > Sk,i > 0, k = l,2, i = l,...,N, 

(40b) 

Pk,i > lk,i > 0, 7i.i72.i = 0, k = l,2, i = l,...,N, 

(40c) 

Pfc.i > 0, > 0, k = l,2, i = l,...,N. (40d) 


N 

max 'Y' R{pi^i,p 2 ,i) (41a) 

1 — 1 

s.t. > Sk,^ > 0, k = l,2, i=l,...,N, (Alh) 

Pk,i > 0, ek,i >0, k = l,2, i = l,...,N, (41c) 


where R{pi^i,p 2 ,i) is the per-slot sum-rate, given by 
R{pi,i,P 2 ,i) = max {[pk,i - "fk,i + ctjljA) (42a) 

71,1,72,1 

s.t. pk,i > lk,i >0, fc = 1, 2, (42b) 

7 i.i 72 ,z = 0. (42c) 

We remark that (ITO and (l42li are the finite-sized battery exten¬ 
sions of the power allocation and energy transfer problems in 
(fTTb and (fT 2 li . respectively. As in Section ITV-AI the solution to 
(l42l l is given by (fT4l l- (fT5l l. It remains to solve (HTt and identify 
the optimal {pk,i} and {ek,i}- Observing that (HTt is a convex 
program, we write the KKT optimality conditions 


dR{pi^i,P2,i) 

dpk,i 


N 

4" ^ ^ (Afc,n ldk,n) ‘Tk,i — 0, 


(43) 


N N 

^ {dk,n) Gik ^ Pj,n) lkk,i — 0, (44) 

n—i n—i 

\k,rSk,i = ll, hASk,^-E^l=(l, (45) 
Tk,iPk,i = 0, p.k,iik,i = 0, (46) 

for k,j = 1, 2, j k, and i = 1,... ,N. Here, Afc,i > 0 and 
I3k,i > 0 are the Lagrange multipliers for the constraints in 
(l4lbl i. and Tk^i > 0 and p,k,i > 0 are the Lagrange multipliers 
for the constraints in (I41cl l. For the water levels in (fTSl) . (03) 
yields 


/ N y 

'^k,i ~ I ^ ', {^k,n Pk,n) '^k,i 1 

\n=i / 


(47) 


For pk,i > 0, the optimal water levels Vk,i may increase 
only when the battery is empty, Sk,i = 0 , and decrease 
only when the battery is full, Sk,i = EJf°‘^. Omitting (04l l. 
this is the finite-sized battery extension of (fT6T l- (fT7l i. and can 
be solved by the iterative generalized directional water-filling 
algorithm where the water flow between time slots is limited to 
^max lEl. Note that energy cooperation over ek,i introduces 
the possibility of energy (and hence water) flow between 
the two nodes in addition to between two time slots. This 
flow dimension was first considered in the two dimensional 
directional water filling algorithm (2D-DWF) in Q Alg. 1] for 
nodes with infinite batteries. Here, we extend it to the finite¬ 
sized battery case, and simplify the algorithm significantly by 
observing the structure of optimal energy transfers {ek,i]- 


Lemma 5 If the generalized directional water-filling algo- 
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Fig. 7. Two dimensional directional water-filling with restricted transfers, 
with (a) initial water levels, (b) water levels after flow within each node, and 
(c) water levels after flow between the two nodes. The flow from Tj. to Tj 
is not allowed unless the battery of Tfc is full, as seen at * = 2 in (b). 


rithm with finite-sized batteries yields pj^i > 0 for some i, 
then efc,i = 0 is optimal, i.e., energy transfer from to Tj 
with the purpose of storage is not necessary in the optimal 
policy. 

Proof: Since {pk,i] is the output of the generalized 
directional water-hlling algorithm, there exists \k,i, (3k,i, and 
Tk,i that satisfy (l43T l. (1451) . and (l46l l. Given pjj > 0, the second 
sum term in (l44l) is equal to while the hrst sum term in 
(gii is greater than or equal to Vf. \ due to Tk,i > 0. By CSll, 
water levels always satisfy Vk,i > > akC(jVk,i, and 

hence (l44l i can be satished for some p,k,i > 0 by choosing 
=0. I 

The Lemma implies that it is sufficient to have a nonzero 
ek,i only when pjj = 0. We combine this insight with the 
condition in dTTl) . which implies that it is sufficient to have 
a nonzero ek,i only when the battery of Tk is full, and 
propose the 2D-DWF algorithm with restricted transfers. In 
this implementation, we modify the 2D-DWF algorithm by 
allowing water flow from Tk to Tj only if the battery of Tk is 
full and pjj = 0. We allow water flow until (l44li is satisfied for 
Pk.i = 0. In accordance with the battery capacity constraint, 
we also limit the water flow among neighboring time slots to 

^max 

The 2D-DWF algorithm with restricted transfers is demon¬ 
strated in Fig. |7] Initially, the entire harvested energy Ek,i is 
allocated to transmission, i.e., pk,i = Ek,i, and water levels 
are obtained from (fT^ . This state is depicted in Fig. |7^. Next, 
directional water flow is allowed for each user individually and 
in time only, i.e., flow in the vertical direction is not allowed. 
The taps marked with right facing arrows limit water flow 
to a maximum of between time slots. The resulting 

water levels are shown in Fig. |7]f. Finally, vertical water flow 
is allowed only in the time slots ending with a full battery, 
e.g., at i = 2 from Ti to T 2 , by turning on the taps marked 
with vertical arrows. Water flow from Ti to T2 continues until 
(l44l l is satisfied, yielding the optimal water levels in Fig. |7};. 
Recall that water flow from Tk to Tj, which represents ek,i, is 
only one component of transferred energy. Energy transfer 5k,i 
may be taking place at * = 1, 3 via the immediately consumed 
component yk,i, which are found using (fT4l i. 


Remark 2 The proof of Lemma 0 is independent of the 
objective function of m- Hence, the optimality of partially 
procrastinating policies, shown in Lemma 0 immediately ex¬ 
tends to the two-hop and multiple access models in SectionsW\ 
and m As a result, the 2D-DWF algorithm with restricted 
transfers can be used for these models as well, provided that 
the water levels are updated as in f l25l ) for the THC and i\30\l 
for the MAC. 


VIII. Numerical Results 

In our simulations of the three channel models, we consider 
a transmission period of = 100 time slots, a noise 
spectral density of Aq = 10“^® W/Hz and a bandwidth of 
W = 1 MHz for both nodes. Battery capacities are = 

pjmax _ Unless otherwise stated, the energy arrivals 

are generated uniformly and independently in [ 0 , 10 ] mJ, the 
channel power gains are hi = h 2 = —100 dB, and the energy 
transfer efficiency values are ai = a 2 = 0.5. These are 
typical system parameters similar to those in previous work 
Q-ii, ina, M- For the purpose of comparison with 
conventional power allocation policies, we also evaluate the 
performance of a constant power policy. Nodes employing 
the constant power policy attempt transmission with a transmit 
power equal to their average energy harvesting rate, i.e., pk,i = 
min{5'fc^i, whenever they are not in an energy outage. 

To verify that the difference in performance is not solely due to 
energy cooperation, we additionally allow energy cooperation 
between nodes employing the constant power policy for their 
consumed powers, i.e., pk,i = while 

allowing the optimal instantaneous energy transfers given by 
(fl4li . (|2^ and below (|2^ . 

A. EHEC Two-way Channel 

For the two-way channel, the average sum-throughput val¬ 
ues are plotted in Fig. [ 8 ] for the optimal policy with two- 
way energy cooperation, optimal policy without energy coop¬ 
eration, and the two constant power policies. The plots are 
obtained by varying the peak harvest rate of Ti, referred to 
as Eh, in [0,10] mJ, while Eij is distributed uniformly on 
[0,Eh]. It can be observed that energy cooperation yields a 
significant increase in performance, particularly since Ti is 
energy deprived compared to T 2 . In other evaluations not 
shown here, a similar insight is observed to hold when one 
node has a notably worse channel. Constant power policies, on 
the other hand, perform consistently worse than the respective 
optimal policies found via generalized directional water-filling. 

In most cases, one node is clearly at a disadvantage in terms 
of energy, and the direction of optimal energy transfer is usu¬ 
ally fixed, i.e., uni-directional energy transfer usually suffices 
to achieve the maximum throughput with energy cooperation. 
However, cases where bi-directional transfer outperforms uni¬ 
directional transfer in either direction are frequently observed. 
These include cases where the energy budgets and channel 
parameters of the nodes are comparable, where the energy- 
deprived node changes within the transmission duration, and 
where the battery capacity is reached at both nodes at different 
points in time. An example is shown in Fig. l9] for the same 
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Fig. 8. Sum-throughput versus peak harvested energy Ei^i for the two- 
way channel with and without energy transfer, compared with the heuristic 
constant power policy. 



Fig. 9. Sum-throughput for the two-way channel with one-way energy 
transfer, and without excess energy transfers i. 



Fig. 10. Sum-throughput versus transfer efficiency ai for the two-way 
channel. 



Fig. 11. Sum-throughput versus peak harvested energy Ei^i for the two- 
hop channel with and without energy transfer, compared with the heuristic 
constant power policy. 


channel parameters in Fig. [8] but with an energy arrival 
scenario where Ti is energy deprived for one half of the 
transmission, and T 2 is energy deprived for the other. Note that 
for low or high harvest rates for node Ti, the uni-directional 
energy transfer performs better in different directions, and both 
directions are essential to achieve the optimal throughput. 

Fig. m also shows the no excess energy transfers policy, 
where the excess energy transfers ek,i are forced to be zero 
while instantaneously consumed energy transfers, 'yk,i are 
chosen freely. The departure of this policy from the full 
cooperation case indicates that the excess energy transfers 
are necessary to find the optimal policy, whereas their impact 
on throughput is not as significant as the impact of instanta¬ 
neously consumed energy transfers. This departure can be seen 
more clearly in Fig.[T0l in which Eh = 10 mJ is fixed, and the 
transfer efficiency ai is varied in [0, ^]. We also remark that 
below a certain energy transfer efficiency, energy cooperation 
in the direction from Ti to T 2 is not necessary to achieve 
the optimal throughput. However, as energy transfer becomes 
more efficient, i.e., for ai > 0.1, the optimal throughput 
increases for both the uni-directional and bi-directional energy 


cooperation cases. 

B. EHEC Two-hop Channel 

We next provide numerical results for the two-hop channel, 
by varying the peak harvest rate of Ti in [0,10] mJ, in Fig. [TT] 
We observe that in this setup, Ti being energy deprived 
significantly hinders the performance in the absence of energy 
cooperation, since both Ti and T 2 need to have sufficient 
energy to transmit in order to have non-zero end-to-end 
throughput. Hence, in this case, energy cooperation is observed 
to be very useful for low Eh- Meanwhile, the performance of 
constant power policies are significantly worse. 

C. EHEC Multiple Access Channel 

Finally, we present numerical results for the two-user mul¬ 
tiple access channel, once again varying the peak harvest rate 
of Ti in [0,10] mJ. The channel gains for users 1 and 2 are 
hi = —100 dB and /i 2 = —110 dB, respectively. Performance 
of the optimal policies with and without energy transfer, and 
constant power policies, are shown in Fig. [TJ] It can be seen 
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Fig. 12. Sum-throughput versus peak harvested energy Ei^i for the multiple 
access channel with and without energy transfer, compared with the heuristic 
constant power policy. 

that when Ti, which has a better channel to the receiver that 
that of T 2 , is energy deprived, energy cooperation from T 2 to 
Ti significantly increases the throughput. 

IX. Conclusions 

In this work, we have identihed the jointly optimal transmit 
power and energy cooperation policies of energy harvesting 
channels for maximizing sum-throughput. This is done by 
identifying a class of policies that contain an optimal policy 
and that enable the decomposition of the problem into energy 
transfer allocation and consumed power allocation problems 
that can be solved in tandem. 

For the two-way channel, the two-hop channel, and the 
multiple access channel, we have found optimal procrasti¬ 
nating policies using variations of directional water-hlling. 
Although the algorithms yielding the optimal policies have the 
water-hlling analogy in common, these models display notable 
differences that also affect the resulting optimal policies. In 
particular, in the two-way channel, the two users’ communica¬ 
tion rates are unaffected by one another, and thus water-hlling 
is performed separately and iteratively for each transmitter. On 
the other hand, in the two-hop channel, the resulting water- 
hlling algorithm admits a two-huid interpretation. Finally, in 
the multiple access channel, the sum-rate is governed by the 
sum-power, and the problem reduces to a single transmitter 
counterpart with aggregate arrivals based on the efficiency of 
energy transfers. Overall, we have demonstrated how (par¬ 
tially) procrastinating policies simplify the joint optimization 
problem, both in the inhnite and hnite battery cases. Simu¬ 
lations have demonstrated that energy cooperation provides 
a notable increase in sum-throughput, particularly when one 
node is at a disadvantage in terms of channel power gains or 
harvesting rate, and the end-to-end throughput depends on both 
nodes being able to transmit. We have also noted that energy 
transfer in one direction is not always sufficient, and that bi¬ 
directional cooperation is needed to achieve the maximum 
sum-throughput. 


Optimality of procrastinating policies has recently proved 
to be a useful tool that extends to more involved channel 
models such as the diamond channel ll^ . It would therefore 
be interesting to extend these results to various other channels. 
Other future directions include considering offline policies 
for imperfect channel state information or imperfect energy 
harvest information, online policies inspired by the offline 
solutions presented in this work, extensions to models with 
channel fading where channel state information dynamics and 
availability at various parties can change the underlying rate 
regions, data buffers, delay constraints, storage inefficiencies, 
and processing energy cost. Lastly, we note that while we 
considered a deterministic approach with energy accumulation, 
transfer, and usage, a queueing treatment of the same remains 
an interesting future direction. 

Appendix A 
Proof of Lemma[3] 

Lemma [3 follows from the constraints (I12bb being linear, 
and the function in (I 12 al l being jointly concave in its 

arguments. Specifically, let the solution to (fTZt for [tt^] and 
[iffe] And and [5^], respectively. For consumed powers 
[oTTfc -f (1 — aj-TTfe] with 0 < a < 1 , the energy transfers 
[aSk -f (1 — a)6k] are feasible, and yield 

R{[aTTk + (1 - a)7ffe]) > aR{[TTk]) + (1 - a)i?([7ffe]), (48) 
due to the joint concavity of {pi,p 2 ) in pi and p 2 . 

Appendix B 
Proof of Lemma|4] 

Let {PkiJ^ki}’ ^ — li2, i = l,...,iV, be a solution 
to (l3^ . We will construct a partially procrastinating policy 
that is feasible. Let 4.i = k ^ 1,2. 
Starting from z = 1, we calculate 

7 /c,i = min|4,i,^| , (49) 

Sk,i+1 = ^k,i+l + ^k,i ~ Ik.i, (50) 

for k,j = 1, 2, j ^ k, and i = 1,... ,N. Note that {'jk.i} in 
(|4^ satisfy (l35] l by definition. Next, for z = 1,..., W, let 

7 fe,z = niax{0,7fc7- 7 ^ 7 }, k,j = 1,2, j ^ k. (51) 

This yields { 7 J j} that satisfy both (iTSl l and (l36l l. Let ^ = 0 
for k,j = 1, 2, j ^ k, and z = 1,..., A^. Starting from z = 1, 
we recalculate Sk.i from (l34l l using { 7 ^ ii i}> and calculate 
Sk,i using the recalculated Sk,i, i.e., 

Sk,i — Rk,i—1 “t” Rk,i Pk,i ^k.i “f (52) 

for k,j = 1,2 and j ^ k, while updating the optimal stored 
component ej ^ as 

e*fc_, = max{0,5fc.,-£;r"}- (53) 

Note that this immediately satisfies dJTl i for all k,j = 1,2 and 

z = l,...,iV. 

The process outlined above postpones energy transfers that 
are not immediately needed via (l49])- (l50l) . eliminates cases 
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of simultaneous bi-directional energy transfer via (ED, and 
transfers excess energy that is overflowing via (ED. The 
resulting policy, ^ partially procrastinating 

policy. Given that the original policy {pj feasible, 

{Pk i, 7fc z, i} feasible by construction. This policy is 

also optimal since the objective of (ED depends only on the 
transmit powers {pk,i], and the transmit powers {pjj} are 
equal in both policies. 
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